CDS

Accession Number TCMCG078C22901
gbkey CDS
Protein Id KAG0491536.1
Location complement(join(15816960..15817103,15817175..15817520,15817599..15817744,15817830..15817931,15818004..15818111,15818194..15818303,15818407..15818516,15818610..15818794,15818876..15819040,15819113..15819231,15819297..15819381,15819447..15819552,15819642..15819736,15819913..15820056,15820143..15820235,15821600..15821666,15821749..15821861,15821963..15822058,15822167..15822331))
Organism Vanilla planifolia
locus_tag HPP92_004934

Protein

Length 832aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000002.1
Definition hypothetical protein HPP92_004934 [Vanilla planifolia]
Locus_tag HPP92_004934

EGGNOG-MAPPER Annotation

COG_category G
Description Beta-galactosidase 11
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0003674        [VIEW IN EMBL-EBI]
GO:0003824        [VIEW IN EMBL-EBI]
GO:0004553        [VIEW IN EMBL-EBI]
GO:0004565        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005618        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005737        [VIEW IN EMBL-EBI]
GO:0005773        [VIEW IN EMBL-EBI]
GO:0015925        [VIEW IN EMBL-EBI]
GO:0016787        [VIEW IN EMBL-EBI]
GO:0016798        [VIEW IN EMBL-EBI]
GO:0030312        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044444        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0071944        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGCAGACCGCGCTCCTCTCCCTCATCTTCGCCGCCGCCTTCGTAGTCTCCGCTCACGGCAAGGGCGAGACGCCAGTTACCTACGACGCGCGCTCTCTCATCATCAATGGAAAGCAGGAACTCCTCTTGTCCGGATCCGTTCATTATCCACGCAGCACCCCGGAGATGTGGCCTGGTATCATTGCCAAGGCCAAAATCGGTGGGCTCAATGTGATCCAAACTTATGTTTTCTGGAATGTGCATGAACCTCTTCAGGGAAAGTACAACTTTGAGGGAAGATACGATTTGGTGAAGTTTGTCAAGTTGATTCAACAAAATGGTATGTATGTGAACCTAAGGATTGGGCCATTCATTCAAGCCGAATGGAACCATGGAGGATTTCCTTTCTGGCTAAGGGAAGTTAAGGATATCACCTTTCGCACAAACAATCCACAGTTTAAGTACCACATGGAGAAGTTTGTGAGGAAGATTGTTAACATGATGATGGAAGAGAAGCTCTTTGCCTCGCAAGGAGGGCCAATCATCCTAGCTCAGATCGAGAATGAGTACAACATGGTGCAAGCTGCTTTCAAGGAAGGAGGAAAGAAGTACATTAAGTGGGCATCTGAGATGGCCATTAGCCTTGGGACTGGAGTTCCATGGATGATGTGCAAGCAGCAAGATGCTCCCGGTCCAGTGATCAATGCATGCAATGGAAGAAACTGTGGTGACACATGGATAGGTCCAAATCATCCAACCAAGCCTCTTGTCTGGGCTGAGAACTGGACTGCACAGTACCGGGTGTTTGGAGATCCTCCCTCACAAAGGTCAGCTGAGGACCTTGCTTACTCTGTTGCTCGCTTCTTCTCCAAGAATGGCACCCTTGCAAACTACTACATGTACCACGGAGGAACCAACTTTGGTAGAACTGGTTCTGCTTACGTCTCGACTCGGTACTATGACGAAGCACCATTGGATGAATATGGCATGCAAAGGGAGCCGAAATGGGGACACCTGAAGGATTTGCACCAAGCTCTGAAATTGAGCAAGAAAGCTCTCCTCTGGGGCTCTGACAGTTTGGTCCCACTGGGGAATAGTCTTGAGGCGAGAGTGTATGAGATTCCTAGCCAAAAAGTGTGCACTGCCTTCCTTACAAATACTCATCAGGCAGATGCCACAGTTAACTTCCGTGGTGTGGAGTACTTCTTGCCTCGCCGCTCTATCAGCATACTCCCAGATTGCAGAACTGTGGTGTACAACACTCAAAGGGTGAATGCTCAGCACAACTCAAGGAGCTTCATTCCATTTGTGGAGACCAAAAAGAAGCTCAAATGGAAGATGTACAGAGACCACATCCCAAGATACCAAGACCAAGGAATTATTCACAATTCAGAACCTTTGGAGCTCATGAAAACTACCAAGGATACTACTGACTACCTCTGGTACACTACAAGCTTCAAGCTTGATCAAGAGGACCTGCCCATGAGGCATGACATCAGTCCAGTGCTTCAAATTTCCAGCCTTGGCCACGCGGTGCATGCATTTGCCAATGGGAAGTACCTTGGAAATGCACATGGAAGCAAGATAGAGAAAAGCTTCGTCTTCAAAAAACCGATGAAGCTGCATACTGGAACCAACCACATCACCATTCTTGCAATGACCGTTGGCTTCCCGGACAGCGGGGCCTATCTGGAGCATAGGATGGCTGGAGTTCACTCAGTACACGTTCAGGGTCTTAACACTGGTACCCTTGACCTCACCAGAAATGGATGGGGACATCAGGTTGGTCTCATTGGTGAGAAACTCCAGATCTACACGAAAGAAGGAGCCAATAGAATTCAATGGAGCAAGTCCGAGAGGAACGTACCCATTACTTGGTACAAGAGATACTTCGATGCCCCCCCAGGCGACGATCCTGTTGTGTTCGACATGAGCTCCATGACGAAAGGGCTTGCGTGGGTTAATGGAGAATGCATTGGTAGATACTGGGTTTCGAACGTCTCACCTCTCGGCACACCCACTCAGACTGTGTACCATGTACCTCGTGCATTCTTGAAGACAACTGAGAATCTCATGGTCGTCTTCGATGAAACCGGTGGCGACCCAAATGGCATCCGAATCTTGACCGCGAGGAGGGACGATATCTGCACCTACGTGTCCCAGTTCCACCCAAGCTCCATTCGGTCGTGGTCGAGAAAGGAAGGGCAGCTCATCTCTTCGGTGGAGGACGTGAAGCCTGAGGCACACCTGAAGTGCCCCAAATCGAAGGTCATCAAGTCTGTCACCTTCGCGAGCTTCGGCAACCCAAGCGGCATCTGCGGCAACTACACCATTGGAAGCTGCCATGCCCCTCAAACCCAATCCATAGTGGAGAAGGCTTGTCTGGGAAAGAGGTCGTGCGTGCTTCCAGTGAAGGTGGAGGCCTATGGCGCTGACGCGAACTGCCCAGGAACAAAGGCAACGCTCGCAGTTCAGGTCGCGTGCGGTGCGAAGATAGAAAACCATATGCGAATGCTGTGA
Protein:  
MQTALLSLIFAAAFVVSAHGKGETPVTYDARSLIINGKQELLLSGSVHYPRSTPEMWPGIIAKAKIGGLNVIQTYVFWNVHEPLQGKYNFEGRYDLVKFVKLIQQNGMYVNLRIGPFIQAEWNHGGFPFWLREVKDITFRTNNPQFKYHMEKFVRKIVNMMMEEKLFASQGGPIILAQIENEYNMVQAAFKEGGKKYIKWASEMAISLGTGVPWMMCKQQDAPGPVINACNGRNCGDTWIGPNHPTKPLVWAENWTAQYRVFGDPPSQRSAEDLAYSVARFFSKNGTLANYYMYHGGTNFGRTGSAYVSTRYYDEAPLDEYGMQREPKWGHLKDLHQALKLSKKALLWGSDSLVPLGNSLEARVYEIPSQKVCTAFLTNTHQADATVNFRGVEYFLPRRSISILPDCRTVVYNTQRVNAQHNSRSFIPFVETKKKLKWKMYRDHIPRYQDQGIIHNSEPLELMKTTKDTTDYLWYTTSFKLDQEDLPMRHDISPVLQISSLGHAVHAFANGKYLGNAHGSKIEKSFVFKKPMKLHTGTNHITILAMTVGFPDSGAYLEHRMAGVHSVHVQGLNTGTLDLTRNGWGHQVGLIGEKLQIYTKEGANRIQWSKSERNVPITWYKRYFDAPPGDDPVVFDMSSMTKGLAWVNGECIGRYWVSNVSPLGTPTQTVYHVPRAFLKTTENLMVVFDETGGDPNGIRILTARRDDICTYVSQFHPSSIRSWSRKEGQLISSVEDVKPEAHLKCPKSKVIKSVTFASFGNPSGICGNYTIGSCHAPQTQSIVEKACLGKRSCVLPVKVEAYGADANCPGTKATLAVQVACGAKIENHMRML